FUSE: Multi-faceted Set Expansion by Coherent Clustering of Skip-Grams
نویسندگان
چکیده
Set expansion aims to expand a small set of seed entities into complete relevant entities. Most existing approaches assume the input is unambiguous and completely ignore multi-faceted semantics As result, given {"Canon", "Sony", "Nikon"}, previous models return one mixed that are either Camera Brands or Japanese Companies. In this paper, we study task expansion, which capture all semantic facets in multiple sets entities, for each facet. We propose an unsupervised framework, FUSE, consists three major components: (1) facet discovery module: identifies entity by extracting clustering its skip-grams, (2) fusion discovers shared entire optimization formulation, (3) expands utilizing masked language model with pre-trained BERT models. Extensive experiments demonstrate FUSE can accurately identify generate quality
منابع مشابه
Modeling Harmony with Skip-Grams
String-based (or viewpoint) models of tonal harmony often struggle with data sparsity in pattern discovery and prediction tasks, particularly when modeling composite events like triads and seventh chords, since the number of distinct n-note combinations in polyphonic textures is potentially enormous. To address this problem, this study examines the efficacy of skip-grams in music research, an a...
متن کاملProtein classification using modified n-grams and skip-grams.
Motivation Classification by supervised machine learning greatly facilitates the annotation of protein characteristics from their primary sequence. However, the feature generation step in this process requires detailed knowledge of attributes used to classify the proteins. Lack of this knowledge risks the selection of irrelevant features, resulting in a faulty model. In this study, we introduce...
متن کاملReady…set…fuse
In This Issue In This Issue Ready…set…fuse he fusion of two lipid membranes underlies a huge range of biological phenomena, from infection by enveloped viruses to the secretion of cellular proteins , but a central aspect of membrane fusion has remained mysterious: do the T proteins that mediate fusion act cooperatively or independently? On page 833, Markovic et al. argue that teamwork is the or...
متن کاملA Unified Learning Framework of Skip-Grams and Global Vectors
Log-bilinear language models such as SkipGram and GloVe have been proven to capture high quality syntactic and semantic relationships between words in a vector space. We revisit the relationship between SkipGram and GloVe models from a machine learning viewpoint, and show that these two methods are easily merged into a unified form. Then, by using the unified form, we extract the factors of the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2021
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-030-67664-3_37